Using Web Data to Investigate Antonym Canonicity

نویسندگان

Steven Jones

M. Lynne Murphy

Carita Paradis

Caroline Willners

چکیده

In the literature, some researchers (e.g. Gross, Fischer and Miller 1989, Charles, Reed and Derryberry 1994) treat antonym pairs as either canonical (for example old/young, cold/hot and happy/sad) or non-canonical (aged/youthful, cool/hot, happy/miserable), while others assume or argue for a continuum between the two categories (e.g. Herrmann, Chaffin, Conti, Peters and Robbins 1979, Murphy 2003). Among the methods that have been used to investigate antonym canonicity are word association tests (Deese 1965, Clark 1970), judgement tests (Herrmann, Chaffin, Daniel and Wool 1986) and elicitation experiments (Paradis, Willners, Murphy and Jones, forth.). This paper approaches the issue by building specifically on research that has demonstrated the tendency of antonyms to favour certain lexico-grammatical constructions in discourse, such as both X and Y, from X to Y and whether X or Y (Justeson and Katz 1991, Mettinger 1994, Fellbaum 1995, Jones 2002). We argue that a language’s most canonical antonym pairs can reasonably be expected to co-occur with highest fidelity in such constructions (fidelity here refers to the tendency of words to co-occur with each other, in preference to other semantically plausible pairings, across the widest possible range of appropriate contexts) and that, given their relatively low frequency in language, an extremely large corpus is needed in order to identify such patterns. The specific aims of this paper are therefore (a) to assess the degree to which a series of lexico-grammatical constructions can be used as a diagnostic of antonymy; (b) to measure the strength of antonym pairs belonging to ten semantic scales by examining their co-occurrence fidelity within these constructions; and (c) to evaluate the usefulness of the World Wide Web as a corpus for research into certain types of low-frequency phenomena in language. In general, studies into antonym canonicity have been based on either the results of metalinguistic activities or on corpus-based searches. To begin with the former, it has been noted that “language users can intuitively sort ‘good’ (or prototypical) antonyms from not-so-good ones and downright bad ones” (Murphy 2003:11). This is often referred to as the “clang phenomenon” – a term used to describe the reaction to those pairs that intuitively strike the hearer as being good ‘opposites’ (Charles and Miller 1989,

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Quantifying aspects of antonym canonicity in English and Swedish: textual and experimental

This paper highlights the potential usefulness of combining corpus methods and experimental methods to gain new theoretical insights into the role of antonymy as an organizing lexicosemantic principle in human thinking and languages’ vocabularies. We are intrigued by what distinguishes so-called canonical antonyms such as good-bad, long-short, thin-thick from other types of contrasts such as co...

متن کامل

Antonymy and Canonicity: Experimental and Distributional Evidence

The present paper investigates the phenomenon of antonym canonicity by providing new behavioural and distributional evidence on Italian adjectives. Previous studies have showed that some pairs of antonyms are perceived to be better examples of opposition than others, and are so considered representative of the whole category (e.g., Deese, 1964; Murphy, 2003; Paradis et al., 2009). Our goal is t...

متن کامل

The Comparative Effect of Antonym in-Text Glosses and Description in-Text Glosses on EFL Learners' Reading Comprehensio

The present study was carried out to investigate the comparative effect of antonym in-text glosses and description in-text glosses on a group of Iranian EFL learners' reading comprehension. To fulfill the purpose of this study, 60 female intermediate students between 18 and 19 years old were selected among a total number of 90 through their performance on a piloted PET. These 60 participants we...

متن کامل

Word Embedding-based Antonym Detection using Thesauri and Distributional Information

This paper proposes a novel approach to train word embeddings to capture antonyms. Word embeddings have shown to capture synonyms and analogies. Such word embeddings, however, cannot capture antonyms since they depend on the distributional hypothesis. Our approach utilizes supervised synonym and antonym information from thesauri, as well as distributional information from large-scale unlabelled...

متن کامل

Canonicity results for mu-calculi: an algorithmic approach

We investigate the canonicity of inequalities of the intuitionistic mu-calculus. The notion of canonicity in the presence of fixed point operators is not entirely straightforward. In the algebraic setting of canonical extensions we examine both the usual notion of canonicity and what we will call tame canonicity. This latter concept has previously been investigated for the classical mu-calculus...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Using Web Data to Investigate Antonym Canonicity

نویسندگان

چکیده

منابع مشابه

Quantifying aspects of antonym canonicity in English and Swedish: textual and experimental

Antonymy and Canonicity: Experimental and Distributional Evidence

The Comparative Effect of Antonym in-Text Glosses and Description in-Text Glosses on EFL Learners' Reading Comprehensio

Word Embedding-based Antonym Detection using Thesauri and Distributional Information

Canonicity results for mu-calculi: an algorithmic approach

عنوان ژورنال:

اشتراک گذاری